Development of the 1998 OGI-FONIX broadcast news transcription system
نویسندگان
چکیده
In speech recognition systems, it is generally required that the training environment be identical to the decoding environment. Any mismatch between them may result in performance degradation. This paper tries to improve the performance of a speech recognition system by compensating for the training and decoding mismatches. The baseline system [1][2] is a multiple pass decoding system capable of transcribing broadcast news, which achieved 30.5% word error rate on the 1997 DARPA HUB4E test set. Three approaches were investigated: (1) Delete long silence in both training and decoding utterances; (2) Enlarge the second-pass decoding dictionary; (3) Merge utterance fragments into a complete sentence. These approaches resulted in 2.8%, 0.3%, and 2.3% absolute error reductions on the 1997 test set, respectively. The combined approach achieved more than 4% absolute error reduction. On the oÆcial 1998 DARPA HUB4E evaluation, the resulting system achieved 27.9% word error rate for the 97 part evaluation data and 23.6% word error rate for the 98 part evaluation data.
منابع مشابه
The 1998 HTK Broadcast News Transcription System: Development and Results
This paper presents the development of the HTK broadcast news transcription system for the November 1998 Hub4 evaluation. Relative to the previous year’s system The system a number of features were added including vocal tract length normalisation; cluster-based variance normalisation; double the quantity of acoustic training data; interpolated word level language models to combine text sources;...
متن کاملSRI’s 1998 Broadcast News System – Toward Faster, Better, Smaller Speech Recognition
We describe several new research directions we investigated toward the development of our broadcast news transcription system for the 1998 DARPA H4 evaluations. Our goal was to develop significantly faster and smaller speech recognition systems without degrading the word error rate of our 1997 system. We did this through significant algorithmic research creating various new techniques. A sample...
متن کاملThe 1997 HTK Broadcast News Transcription System
This paper presents the recent development of the HTK broadcast news transcription system. Previously we have used data type specific modelling based on adapted Wall Street Journal trained HMMs. However, we are now using data for which no manual preclassification or segmentation is available and therefore automatic techniques are required and compatible acoustic modelling strategies must be ado...
متن کاملThe Development of the 1997 Cmu Spanish Broadcast News Transcription System
This paper describes the 1997 CMU DARPA Hub 4 Spanish Broadcast News Transcription system. The system we present is based on the CMU SPHINX-III recognizer and uses a single set of acoustic and language models. The decoding process is performed in two passes: a Viterbi search and a directed acyclic graph (DAG) search are performed on the first recognition stage. The second recognition stage is s...
متن کاملThe CUHTK-Entropic 10xRT Broadcast News Transcription System
This paper describes the development of the CUHTK-Entropic 10xRT Broadcast News Transcription System. Previous HTK broadcast news transcription systems have focused on maximising accuracy with few constraints on compute power available. In order to develop a system running in under 10 times real time on a single CPU, detailed investigation and optimisation of the system architecture and mode of...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999